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METHOD AND APPARATUS FOR INCREMENTAL CONCURRENT 
5 LEARNING IN AUTOMATIC SEMICONDUCTOR WAFER AND LIQUID 

CRYSTAL DISPLAY DEFECT CLASSIFICATION 

The invention relates to a method and apparatus 
for semiconductor wafer defect and liquid crystal 
display defect classification and more particularly to 
10 a method and apparatus for providing incremental 
concurrent learning to an automatic semiconductor 
wafer defect classification system. 

BACKGROUND OF THE INVENTION 
Complex and very expensive automated systems are 
15 available to locate and detect potential anomalies on 
semiconductor wafers and liquid crystal displays 
quickly. However, determining whether these anomalies 
are actually defects and, if so, obtaining their 
precise location, assessing their impact on the 
2 0 performance of the die when packaged, and diagnosing 
their cause still requires manual classification. 
These time-consuming methods are very slow and 
inconsistent. Results vary with time available, 
knowledge, training, ability and fatigue of the 
25 operator. On review, human experts in average agree 
with only 62 percent of visual classifications. 
Experts' agreement with their own classifications 
diminish markedly over time so that at the end of a 
month, agreement with their own classifications can be 
below 50%. These problems, widely observed in diverse 
task environments, are associated with limitations of 
human memory and cognition. To ensure consistent 
quality control for submicron semiconductor 
manufacturing processes and liquid crystal display 
manufacturing processes, automatic defect 
classification, that is directly associated with 
determination of cause, is necessary. 
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Types of macro and micro defects include: Gray 
Spot, Gray Streak, Gray Spot and Gray Streak, 
Particles, Multi-Layer Structure, Line Break, 
Subsurface Line, Scratch, Hillocks, Grass, Worm-hole, 
5 Starburst, Speedboat, Orange Peel, Resist Gel Defect, 
Controlled Collapse Chip Connection (C4) , Microbridge, 
Submicron, Micron, Micron Sphere, U. Pattern, 
Contamination, Protrusion, Break, Intrusion, Nuisance, 
Mask-Related (Shorts) , Haze, Micro-contamination, 
10 Crystalline (Stacking Fault) , Spots, Break, Reticle, 
Hard-Defects (Pinholes, Pindots, Extrusions), Semi- 
Transparent (Resist Residues, Thin Chrome), 
Registration (Oversized, Undersized, Mislocated) , 
Corner, Extra Metal, Metal Missing and Opens (Pattern 

15 Missing) . 

Because of the dynamic nature of the incidence of 
semiconductor defects, establishing a set of defect 
classes or descriptions in advance that covers all 
possible defects is not possible and efficient. The 
20 high degree of customization required for each 
application adds another dimension to the inherent 
difficulties- An automatic defect classification 
system must be able to add and delete defects from a 
set of reference examples as well as to alter or 

25 refine classification rules at any time as production 
circumstances and requirements change. Furthermore, 
new defect classes may arise as the process evolves or 
as new types of equipment are introduced. An 
automatic defect classification system should also be 

30 capable of learning new classes. 

Because operators cannot be economically trained 
to create and edit knowledge bases containing expert 
system rules, the system should advantageously be able 
to acquire such rules directly and automatically from 

35 images of defects selected as examples. Information 
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about previous and related defects, including images 
of them f must be made readily available to technical 
personnel involved in the diagnosis of defects and 
those that take measures to deal with their causes. 
5 The number of defect examples available for each 

class of defect can be quite small. Thus, the defect 
classification and diagnosis system must be able to 
achieve a high level of accuracy with as few as three 
examples for each class of defect, and must be capable 
10 of being quickly and easily modified. 

SUMMARY OF THE INVENTION 
This invention discloses a method and apparatus 
for incremental concurrent learning that allows 
automatic update of the classification rules for 
15 highly robust classification of defects in an 
automatic semiconductor wafer defect and a LCD 
classification environment. The invention not only 
provides automatic defect classification capability 
but also allows for updating of the system to deal 
20 with changing product design and process cycles. The 
system can handle new defect types with a small number 
of defect examples. In addition, the system allows 
for human interaction by not forcing classification of 
questionable objects and can automatically organize 
25 defect examples and display them to a human operator, 
along with similar defects with known types for human 
verification and truth labeling. 

The defect classification results can be encoded 
into profiles for different products, and process 
3 0 cycles for quality control evaluation and yield 
management . 

The invention provides a method of incremental 
concurrent learning comprising the steps of; providing 
potential defects and fabrication information, 
35 providing a classification rule selection from a 
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10 



defect knowledge database from multiple products with 
different process cycles, performing a truth inquiry 
to update a classification rule database for use by 
the classification rule selection, performing a single 
stage classification having at least one 
classification output and at least one confidence 
output, checking the at least one confidence and if 
the confidence of the at least one classification is 
high, defining the classification outcome and 
providing the classification outcome to the 
classification rule selection step, and if the at 
least one confidence is not high, questioning the 
classification outcome and sending the potential 
defects to the novelty defect detection module, and 
15 performing a novelty defect detection step to define 
artifacts or a potential new defect type to provide 
information for the truth inquiry. 

Further, the invention provides that a novelty 
detection step comprises: providing potential novel 
20 defects., performing artifact rejection of potential 
novel defects and generating an artifact database and 
potential novel defect database, and performing a 
clustering of a potential novel defect database to 
provide use of clusters and other objects providing a 
25 truth determination of the defect clusters to update 
the potential novel defect database. 

The invention further performs automatic 
classification rule generation that comprises 
providing defect samples with truth labels where 
3 0 features are selected, performing a sample editing on 
the defect database, and performing a condensing of an 
output of the sample editing. 

The invention performs adaptive learning for the 
initialization of prototypes, performs adaptive 
3 5 learning with a L.VQ method, and generates high and low 
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confidence thresholds for the rules to be generated. 

The invention further performs a truth inquiry 
further comprises the steps of editing and condensing 
a defect cluster, and performing a similarity 
5 detection step to compare known defects to 
representative unknown defects. 

The invention further provides a defect knowledge 
database comprising a plurality of products wherein 
each product has a processing layer where each 
10 processing layer has a plurality of defect types 
wherein each defect type has a plurality of rules 
wherein each rule has a prototype. 

The invention detects defects on semiconductor 
wafers or liquid crystal displays. 
15 Other objects, features and advantages of the 

present invention will become apparent to those 
skilled in the art through the description of the 
preferred embodiment, claims and drawings herein 
wherein like numerals refer to like elements. 
20 BRIEF DESCRIPTION OF THE DRAWINGS 

To illustrate this invention, a preferred 
embodiment will be described herein with reference to 
the accompanying drawings . 

Figure 1 shows the method and apparatus of the 
25 invention. 

Figure 2 shows the training and application phase 
of the automatic wafer defect classification system. 

Figure 3 shows the method of the invention to 
process potential defects by using fabrication 
3 0 information to control incremental concurrent 
learning. 

Figure 4 shows the processing flow diagram of the 
invention to implement the novelty detection step. 

Figure 5 shows a process flow diagram of the 
3 5 truth inquiry module of the invention. 
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Figure 6 shows the classification rule generation 
of the invention. 

Figure 7 shows the structure of the defect 
knowledge database of the invention. 
5 Figure 8 shows the learning vector quantization 

method of the invention as applied to one example in 
feature space. 

Figure 9 shows the method of the invention to 
generate a transformed feature list . 
10 Figures 10A and 10B show one embodiment of a 

method of the invention for periodic sampling and 
classification rule confirmation or update. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 
Refer now to Figure 1 which shows the method and 
15 apparatus of the invention. The apparatus for 
incremental concurrent learning in automatic 
semiconductor wafer and liquid crystal display defect 
classification comprises a wafer storage and loading 
module 18 and a wafer storage and unloading module 20, 
20 where both can be combined into one module, to 
transport a wafer 15 to and from an automated stage 
16. The automated stage 16 moves the wafer 15 under 
a microscope 17 for scanning. The invention further 
provides a light source 14 and a sensing device 12 to 

2 5 image the area of the wafer 15 under inspection. A 

processing unit 22 comprises a host computer 21 , 
multiple controllers 23, a high speed image processing 
unit 25 and a power supply 27. The sensing device 
sends a raw image 24 to processing unit 22 through 

3 0 connection 19. The processing unit 22 controls the 

scanning of the wafer 15 and performs the image 
decomposition and synthesis processing for automatic 
inspection. The high speed image processing unit 25 
operates to facilitate the information extraction 
35 process by implementing the decomposition and 
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synthesis method in software. The method decomposes 
the raw image data 24 into a set of partial 
information channels, each channel reflects certain 
aspects, a modality, of the information. The partial 
5 information can thus be processed independently or 
cooperatively and synthesized to recover the original 
image as recovered image 24A or to construct an 
improved image 33. In this manner, useful information 
can be retained and irrelevant information can be 

10 rejected effectively. 

Refer now to Figure 2 which shows the training 
and application phase of the automatic wafer defect 
classification system 10. In the training phase, 
defect images with truth labels 40 are input to a 

15 context sensitive segmentation step 42. The output of 
the context sensitive segmentation step 42 is input to 
the extraction step 44, which extracts a set of 
comprehensive features from the segmented image. The 
output of the extraction step 44 is sent to an 

20 automatic feature selection and transformation step 46 
that provides an application feature set 60 used in 
the application phase. The automatic feature 

selection and transformation step 46 provides an input 
to the automated classification rule generation step 

25 48. After the automated classification rule 

generation step 48, the initial classification rules 
62 are sent to the application phase. In the 
application phase, potential defects 50 are provided 
to a context sensitive segmentation step 52. The 

3 0 output of the context sensitive segmentation is a 
segmented image. Both the segmented images and the 
defect images are provided to feature extraction step 
54. The feature extraction step takes the application 
feature step 16 and provides information for future 

35 classification in step 56. The output . of feature 
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classification step 56 is defect classifications 58. 

In operation, the defect classification system of 
the automatic wafer defect classification system 
implements a training phase and an application phase. 
• 5 In the training phase, knowledge is acquired directly 
from defect images 40. Sufficient examples of defect 
images with truth labels are processed by the system. 
The defect objects 40 are first segmented 42 and a set 
of comprehensive features are extracted 44 from the 

10 images of the segmented objects. The extracted 
features with object truth labels are used to select 
the significant feature set 60 for further analysis. 
Using the significant feature set 60, new features are 
generated by feature transformation methods. Based on 

15 the selected and transformed features, classification 
rules are automatically generated 4 8 for defect 
classifications . 

Refer now to Figure 9 which shows the method of 
the invention to generate transformed features. The 

2 0 method starts by providing a potential feature list 

210. Statistical feature selection is performed in 
step 212. In one embodiment of the invention, the 
stepwise discriminant method is used to do feature 
selection by evaluating the significance of F-test 
25 statistics from an analysis of covariance. The output 
of the statistical feature selection is a significant 
feature set 214. The significant feature set 214 is 
then used by four methods to generate potential 
features for evaluation in step 224. The feature 

3 0 evaluation process 224 evaluates the effectiveness of 

the transformed features from the transformation 
methods by computing the areas under a receiving 
operating curve. The larger the area, the greater the 
effectiveness of the transformed feature. The output 
35 of the process is a list of selected transformed 
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features 226. 

The first method is the multilayer perceptron 
(MLP) 216. MLP is one type of ANN using unsupervised 
back-propagation method to learn the network weights. 
5 Given the selected features and defect samples with 
true labels, back-propagation learning method is 
applied to determine the network weights. These 
learned weights define the non-linear formula to 
transform the feature set into transformed features. 
10 The number of transformed features are usually much 
smaller than the original features. 

The second method is the group method of data 
handling GMDH 218. GMDH is a method to model complex 
output (transformed features) by input features using 
15 recursive generations of linear or nonlinear 
regression molds using pairs of features or derived 
features as input. 

The third method is the rule based transfer 
method 22 0. 

20 The fourth method is the statistical linear 

projection 222. This method computes multiple 

discriminant analysis on C-class defect data. The 
method is a generalization of the Fisher's Linear 
Discriminant involving C-l discriminant functions. 

25 The Fisher's Linear Discriminant is a well-known 
statistical method for feature projection. 

In the application phase, the potential defects 
50 are automatically segmented 52 from their images. 
The features of the objects are extracted 54 and the 

30 classification rules are applied to classify 56 the 
potential defects into artifacts or different defect 
types 58. If no definite outcome can be achieved, the 
system provides a questionable outcome label to alert 
a human operator that manual analysis may need to be 

3 5 performed. 
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In a preferred embodiment of the invention, the 
object segmentation method disclosed in the "Method 
for Identifying Objects Using Data Processing 
Techniques", US Patent No. 5,528,703 can be used for 
5 object segmentation, such as that performed in steps 
42 and 52. The comprehensive feature set 60 includes 
single features such as size, shape, texture, 
contrast, color, etc., pairs of features and their 
relationships such as connect, adjacent, touch, 

10 inside, outside, etc., and multiple features with 
higher level relationships. 

In one embodiment, the feature selection method 
4 6 comprises methods such as the stepwise discriminate 
analysis. Stepwise discriminate analysis can be used 

15 to select the features having the highest discriminate 
power for classification in a well known manner. 

In one embodiment, the feature transformation 
step 46 can be accomplished by statistical feature 
extraction, methods such as Group Method of Data 

20 handling, among others. The selected features along 
with the transformed features are used to train the 
defect classifier. The Classification And Regression 
Trees (CART) can be automatically generated and be 
used as the basis for the classifiers. Since there 

2 5 may only be small number of samples per defect type 

for training and incremental learning, this capability 
is required for this application, other classification 
methods such as k nearest neighbor rule (k-NN) or its 
derivatives can be used to better provide the desired 

3 0 functions. 

As memory and computational capacity increase, 
for example, exponentially in the past decade, and if 
the same growth pace is expected for the next decade, 
non -parametric supervised classification rules such as 
3 5 the k nearest neighbor rule (k-NN) which, are memory 
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and computational demanding yet very flexible, become 
feasible for semiconductor defect classification. 
However, the requirement of a large training set to 
ensure convergence of the K-NN rule is the main 
5 drawback of the nearest neighbor rules in practical 
problems. Moreover, non -parametric supervised 

classification rules can be negatively influenced by 
incorrectly labeled training samples, "noisy" samples 
or outliers. In a preferred embodiment of this 
10 invention, the K-NN rule is enhanced by an 
editing-condensing method and adaptive learning 
method. 

The editing- condensing method improves the 
accuracy of the classification by removing samples 
15 located in overlapping acceptance surfaces, the 
editing method, and reduces the computational effort 
required to find the nearest neighbor (s), the 
condensing method. After a training set is edited and 
condensed, an adaptive learning method can be used. 

2 0 The adaptive learning method uses the training samples 

to tune a set of training examples known as 
prototypes. Adaptive learning is performed in two 
sequential phases: initialization and learning. The 
prototype set is initially a subset of the training 
25 set and the values of the prototypes are updated in an 
interactive learning process. Two different 

approaches may be used: the first is based on the use 
of learning vector quantization, LVQ, and its 
variations and the second is the decision surface 

3 0 mapping method, DSM. 

In LVQ learning, the location of the prototypes 
in the representation space approximates the 
underlying probability densities using a number of 
updated prototypes, where updating occurs during 
3 5 training using a punishment -reward criterion. In one 
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embodiment, a variation of the adaptive learning 
method learns the variable number of prototypes by 
adding or deleting the number of prototypes in the 
initial prototype set based on the nearest distance 
criteria is used. 

The DSM places the prototypes, that approximate 
the decision boundaries, in the representation space. 
In both LVQ and DSM, the 1-NN classifier is used with 
the prototype set as the reference set. 

In another embodiment, a nonparametric automatic 
rule generator based on a decision-tree method is used 
to generate classification rules. 

Refer now to Figure 8 which shows the learning 
vector quantization method of the invention as applied 
15 to one example in feature space. Two features are 
shown, feature fl 205 and feature f2 206. Two types 
of known objects, object x 202 and object o 203 and 
potentially new objects 204 are also shown. Figure 8 
shows that a cluster 207 of potentially new objects 
2 0 204 that do not meet the criteria for an object x 202 
nor an object o 203 may be grouped as a new object 
type. Different training sets are generated for 
defects of different products and different processing 
cycles. The editing and LVQ methods are applied to 
25 each of the different training sets to generate 
prototype sets of the different classifiers. 

After the prototype set is generated for a 
classifier, the original training sets are re-applied 
to the classifier and the difference between each 
30 training sample and its nearest prototype of the same 
defect type is recorded. The distributions of the 
difference values are used to generate a high 
confidence threshold and a low confidence threshold 
for each defect type of the classifier. Any object 
35 with a feature difference to the nearest prototype 
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less than the high confidence threshold can be 
confidently classified as a defect of the same type as 
the nearest prototype. Any object with a feature 
difference to the nearest prototype greater than the 
5 low confidence threshold cannot be confidently 
classified to the defect type of the nearest 
prototype. This is the case for unknown objects 204. 

Figure 3 shows the method of the invention to 
process potential defects by using fabrication 
10 information to control incremental concurrent 
learning. Potential defects 60 are provided to defect 
classification subsystem 61. The defect 

classification subsystem 61 comprises a single 
classifier or a multistage classifier. In one 
15 embodiment, the multistage classifier comprises two 
classifiers: primary classifier 62 and secondary 
classifier 66. In an alternate embodiment, a single 
classifier 61 comprises the primary classifier 62. 
Primary defect classification step 62 makes a 
20 determination in step 64 of whether or not the primary 
defect classification step 62 has provided an output 
having high confidence. If the output has high 
confidence, the classification determines a definite 
classification output 68. The definite classification 
25 output 6 8 may be provided to step 8 9 for periodic 
sampling and review. 

If there is not high confidence in the primary 
defect classification step 64, the process flows to 
step 66 to determine the secondary defect 
30 classification in step 66. The process then 

determines whether or not the secondary defect 
classification is of high confidence in step 70. If 
the secondary defect classification is of high 
confidence, the process flows to step 74 to determine 
35 a definite classification outcome. The definite 
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classification output 68 may be provided to step 89 
for periodic sampling and review. In step 89, the 
method performs periodic sampling and review for rule 
confirmation or update as shown in greater detail with 
5 respect to Figures 10A and 10B. Rule confirmation or 
update from step 8 9 may be provided as input to the 
defect knowledge database 8 6 or to the primary 
classification rule and secondary classification rule 
selection 84. 

10 If the confidence is not high in step 70, the 

process flows to step 72 to reconcile the questionable 
classification outcome. The process then flows to 
step 7 6 to perform novel defect detection. In outcome 
78, the artifact classification outcome is defined 

15 with the novel defect detection step 76 . The result 
of the novel defect detection step 76 is also provided 
to a truth inquiry step 80 which provides a potential 
new defect type. In step 70, if the confidence for 
the classification was high, the classification result 

20 is provided to step 84 which also receives fabrication 
information input 61. The primary classification rule 
and secondary classification rule selection step 84 
provides primary classification and secondary 
classification rules to the primary defect 

25 classification step 62. In step 88, humans are 
queried as to the truth of the potential new defect 
type. If the potential new defect type is indeed a 
real new defect type, the true defect types and 
classification rules in the defect knowledge database 

30 are updated in step 82. In classification rule update 
82, the primary classification rule and secondary 
classification rules selected in step 84 are updated 
to include the new defect type rules for subsequent 
classification. The defect knowledge database 86 

3 5 receives the output of the classification rule update 
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In operation, the incremental concurrent learning 
module takes the inputs from the fabrication 
information system and selects a primary 
classification module and a secondary classification 
5 module from the database based on the current product 
and process cycle. The fabrication information can 
include: computer-aided design or CAD files, wafer 
electrical test results, histories of defects, defects 
on previous layers of the wafer, patterns of defects 

10 for an entire lot or production run, process models 
and histories, or wafer processing histories, among 
others. The rule selection could be done by a simple 
indexing scheme or more complicated expert system rule 
based approach. The result of the selection 

15 determines the primary defect classification module 
and secondary defect classification selection module 
84. 

The potential defect 60 is first classified by 
the primary classification module 62 . If the 

20 resulting classification confidence is high, a 
definite classification outcome can be determined 68 
and the classification process for this object is 
completed at this point and the system is ready for 
the next defect. If the classification confidence is 

25 low, the object will be classified by the secondary 
classification module 66. If the resulting 

classification confidence is high, a definite 
classification outcome can be determined in step 74. 
The classification process is completed and the system 

30 is ready for the next defect. If the classification 
confidence is low, the object could represent a new 
defect type and the object will be processed by the 
novelty detection module 76 . The novelty detection 
module 76 could reject the object as an artifact or 

35 consider the object questionable and activate further 
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novelty detection processes. After a sufficient 
number of the potential new defects are accumulated in 
the database, the novelty detection module 76 will 
attempt to identify a group of similar potential 
5 defects. A truth inquiry 80 is applied to the defect 
cluster. After the truth is determined, the defect 
knowledge database 86 and the classification rules 82 
are updated using the new objects with their truth 
labels provided by human experts. In one embodiment 

10 of the invention, the defect classification results 
can be encoded into profiles for different products, 
and process cycles for quality control evaluation and 
yield management. Step 89 is a defect type sampling 
and review process for periodical rule confirmation or 

15 update. This module samples and reviews the defect 
classification outcome from the classification 
subsystem. 

Refer now to Figures 10A and 10B which show one 
embodiment of a method of the invention for periodic 

2 0 sampling and classification rule confirmation or 

update. The method of the invention determines a 
graph 3 00 according to features f 1 and f 2 having a 
plurality of defect types 302, 304, 306. Defect types 
306 may be initially classified with defect types 304. 

25 Random sampling review processes subsequently detect 
that defect objects 3 06 represent a new type of 
defect. The method selects a clustering method to 
select defect objects 304 and 306 for classification 
update. Using feature selection and transf ormation 

30 methods discussed in further detail with respect to 
Figure 9, the method determines a new feature set, f 3 
and f 4 . The method then determines a graph 310 with 
the new feature set to distinguish the defect types 
304, 308. The method then updates the rules in the 

3 5 database. The new feature set is added to the feature 
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within the database. 

The method for primary classification is similar 
to the method for secondary classification but use a 
different set of rules and different defect features. 
5 Given the fabrication information as an input, i.e., 
what product, what layer/process, the associated set 
of rules for primary said secondary classification are 
retrieved from the database to prepare for 
classification. Depending on the set of rules 

10 selected, their associated set of defect feature 
computation modules are enabled in the primary or 
secondary defect classification module. 

In one example embodiment, each potential defect 
image is input to the module, features required for 

15 classification, such as size of the defect, shape, 
contrast, chromatic information, connectivity, 
adjunct, relative position (in/out) surrounding 
defect, etc., are computed. After the features are 
computed, the classification method such as LVD, K-NN 

20 is applied. The invention computes the feature 
distance between the defect features and all 
prototypes, the nearest prototype type and confidence 
are obtained. A prototype can be considered as a 
specific rule regarding the feature space . 

25 One example of a Defect Rule Knowledge DataBase: 

entries: product name, 

layer/process, 

defect type (f eature__namel , value) . . . 

f eature_nameN, value) . 
3 0 Refer now to Figure 4 which shows the processing 

flow diagram of the invention to implement the novelty 
detection step. Potential novel defects 104 are 
provided so that artifacts 94 may be rejected from the 
potential novel defect database 92. The potential 
3 5 novel defect database 92 is, in one embodiment, a 
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fast -accessing database storing the set of detected 
potential novel defect samples, where each has a set 
of data including feature name and computed values for 
that product, layer/process. Other database types may 
5 also be used. The artifact rejection step 90 updates 
the potential novel defect database 92. The potential 
novel defect database 92 also has a human 
determination input from a truth inquiry, step 96. 
The human truth determination step 96 also receives 

10 defect clusters 100. In step 98, the output of a 
potential novel defect database 92 is clustered and 
the clustering method provides the defect clusters 10 0 
with other types of objects 102. The updating 
database phase removes the objects in defect clusters 

15 from the potential novel defect database 92. An 
object with low classification confidence could 
represent an instance of a novel defect type. The 
object could also be simply an artifact. All objects 
with low classification confidence are processed by 

20 the novelty detection module. The module first 
applies an artifact rejection stage to the input 
objects. If an object is determined to be an 
artifact, no further action is required. Otherwise, 
the object is used to update the potential novel 

25 defect database 92 and evaluates the similarity of the 
potential novel defect objects in the database by a 
clustering method 98, If no significant clusters of 
similar objects can be identified, the objects 102 are 
most likely artifacts caused by noise or random 

3 0 variations and no potential new defect types are 
detected. On the other hand, if defect clusters can 
be identified having multiple objects with similar 
feature profiles, the module has discovered potential 
new defect types. In this case, the module selects a 

35 subset of the most representative objects, for human 
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truth input. After human truth determination 96, the 
module updates the potential novel defect database 92 
to remove the objects with known truth from the 
database . 

5 The method for artifact rejection is very similar 

to the defect type classification. The invention uses 
a set of artifact classification rules developed for 
each product, layer/process, by the x nearest 
neighbor, X-NN method. However, there is no artifact 

10 type required in the artifact knowledge database. 

In one embodiment of the invention, the potential 
novel defect database 92 stores potential new defects 
from multiple wafers and spans different products and 
processing cycles for evaluation. Unsupervised 

15 clustering methods are used to determine the 
significant clusters from the database. In one 
embodiment of the invention, the clustering step 98 
uses isodata algorithm, fuzzy isodata algorithm, and 
the k means clustering algorithm, fuzzy c means 

2 0 clustering. These methods are based on the distance 
metric in feature space. To make sure only 

significant clusters are detected, a predetermined 
minimum difference threshold is required to qualify 
objects to include in a cluster and a cluster has to 

25 include at least a predetermined number of objects, 
for example ten, before the cluster will be 
considered. 

Refer now to Figure 5 which shows a process flow 
diagram of the truth inquiry module of the invention. 

30 After a defect cluster 118 is identified, the truth 
inquiry module applies an editing-condensing method 
12 0 to select a subset of the defects as 
representative ones to be shown to human for truth 
inquiry in step 96. To enhance the human input 

35 process 126, known defects 123 in the defect database 
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are compared, in a similarity detection step 124, with 
the selected unknown objects 121 and a predetermined 
number of the known defects which are most similar to 
the unknown objects are identified and are shown to 
5 the human 128 with their known defect types first 
before acquiring human truth for the new defects. 

After human input 126, if new defect types are 
identified or if the objects represent a significant 
deviation from known defect types, the information 

10 will be entered into the defect knowledge database 122 
and the classification rule will be updated using the 
editing-condensing and LVQ procedures of the new or 
existing defect classes. If the objects are 

determined to be artifacts, the artifact rejection 

15 stage 90 in Figure 4 will be updated to incorporate 
the new artifacts for rejection. 

Now refer to Figure 6 which shows the 
classification rule generation of the invention. The 
defect samples 106 with truth labels and feature 

20 selections are provided to a sample editing step 108. 

A k nearest neighbor, K-NN, condensing step is then 
performed in step 110. In step 112, adaptive learning 
is performed to initialize prototypes and in step 114 
the adaptive learning uses the LVQ method. In step 

25 116, the method generates high and low confidence 
thresholds. The high confidence threshold is 

predetermined based on the sensitivity and specificity 
requirements of the inspection. 

The invention indexes the defect knowledge 

3 0 database by product, layer and process to select the 
appropriate set of rules for classification. Now 
refer to Figure 7 which shows the defect knowledge 
database of the invention. The knowledge database is 
structured at a first level with a definition of a 

3 5 number of products, for example product 1, product 2, 
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through product N. Each product has an associated 
processing layer, each processing layer having 
multiple defect types known as defect type 1, defect 
type 2, and defect type N; where each defect type has 
5 a number of rules, each rule having an associated 
prototype. Two different products may share certain 
layer or process information. Likewise two different 
layers may share the same defect type information. 
The same prototype may be shared by two or more 
10 different products. 

The following patents and patent applications are 
incorporated by reference hereto : 

U.S. Patent No. 5,787,188, issued 7/28/98, 
entitled "Method For Identifying Normal Biomedical 
15 Specimens" to Nelson et al., which is a file wrapper 
continuation of abandoned U.S. Patent Application 
Serial No. 07/838,064, filed February 18, 1992. 

U.S. Patent No. 5,528,703, issued June 18, 1996, 
entitled "Method For Identifying Objects Using Data 
20 Processing Techniques" to Lee, which is a file wrapper 
continuation of abandoned U.S. Patent Application 
Serial No. 07/838,395, filed February 18, 1992. 

U.S. Patent No. 5,315,700, issued May 24, 1994, 
entitled "Method And Apparatus For Rapidly Processing 
2 5 Data Sequences" to Johnston et al. 

U.S. Patent No. 5,361,140, issued November 1, 
1994, entitled "Method and Apparatus for Dynamic 
Correction of Microscopic Image Signals" to Hayenga et 
al. 

30 U.S. Patent Application Serial No. 08/302,355, 

filed September 7, 1994, for which the issue fee has 
been paid, entitled "Method and Apparatus for Rapid 
Capture of Focused Microscopic Images" to Hayenga et 
al., which is a continuation-in-part of abandoned U. S . 

35 Patent Application Serial No. 07/838,063- filed on 
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February 18, 1992. 

U.S. Patent No. 5,757,954, issued May 25, 1998, 
entitled, "Field Prioritization Apparatus and Method" 
to Kuan et al. 

5 Pending U.S. Patent Application Serial No. 

08/927,379, filed September 12, 1997, entitled 
"Apparatus for Automated Identification of Cell 
Groupings on a Biological Specimen" to Wilhelm et al. , 
which is a file wrapper continuation of abandoned U.S. 
10 Patent Application Serial No. 08/309,061, filed 
September 20, 1994. 

Pending U.S. Patent Application Serial No. 
08/969,970, filed November 13, 1997, entitled 
"Apparatus for Automated Identification of Thick Cell 
15 Groupings on a Biological Specimen" to Meyer et al . , 
which is a file wrapper continuation of abandoned U.S. 
Patent Application Serial No. 08/309,116, filed 
September 20, 1994. 

U.S. Patent No. 5,787,189, issued 7/28/98, 
20 entitled "Biological Analysis System Self Calibration 
Apparatus" to Lee et al . , which is a file wrapper 
continuation of abandoned U.S. Patent Application 
Serial No. 08/3 09,115, filed September 20, 1994. 

Allowed U.S. Patent Application Serial No. 
25 08/678,124, filed July 11, 1996, for which the issue 
fee has been paid, entitled "Apparatus for 
Identification and Integration of Multiple Cell 
Patterns" to Lee et al., which is a file wrapper 
continuation of abandoned U.S. Patent Application 
30 Serial No. 08/308,992, filed September 20, 1994. 

U.S. Patent No. 5,627,908, issued May 6, 1997, 
entitled "Method for Cytological System Dynamic 
Normalization" to Lee et al. 

U.S. Patent No. 5,63 8,459, issued June 10, 1997, 
35 entitled "Method and Apparatus for Detecting a 



SUBSTITUTE SHEET (RULE 26) 



WO 99/01842 



PCT/US98/13670 



- 23 - 

Microscope Slide Coverslip" to Rosenlof et al . 

U.S. Patent No. 5,566,249, issued October 15, 
1996, entitled "Apparatus for Detecting Bubbles in 
Coverslip Adhesive" to Rosenlof et al . 
5 Pending U.S. Patent Application Serial No. 

08/867,017, filed June 3, 1997, entitled "Cytological 
Slide Scoring Apparatus" to Lee et al . , which is a 
file wrapper continuation of abandoned U.S. Patent 
Application Serial No. 08/309,931, filed September 20, 
10 1994. 

Allowed U.S. Patent Application Serial No. 
08/309,250, filed September 20, 1994, for which the 
issue fee has been paid, entitled "Apparatus for the 
Identification of Free-Lying Cells" to Lee et al. 
15 U.S. Patent No. 5,740,269, issued April 14, 1998, 

entitled "A Method and Apparatus for Robust Biological 
Specimen Classification" to Oh et al. 

U.S. Patent No. 5,715,327, issued February 3, 
1998, entitled "Method and Apparatus for Detection of 
20 Unsuitable Conditions for Automated Cytology Scoring" 
to Wilhelm et al. 

U.S. Patent No. 5,797,130, issued 8/18/98, 
entitled "Method for Testing Proficiency in Screening 
Images of Biological Slides" to Nelson et al., which 
25 is a file wrapper continuation of abandoned U.S. 

Patent Application Serial No. 08/153,293, filed 
November 16, 1993. 

U.S. Patent No. 5,699,794, issued December 23, 
1997, entitled "Apparatus for Automated Urine Sediment 
3 0 Sample Handling" to Fleck. 

Pending U.S. Patent Application Serial No. 
08/485,182, filed June 7, 1995, entitled "Interactive 
Method and Apparatus for Sorting Biological Specimens" 
to Lee et al . 

35 U.S. Patent No. 5,647,025, issued July 8, 1997, 
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entitled "Automatic Focusing of Biomedical Specimens 
Apparatus" to Frost et al. 

U.S. Patent No. 5,677,762, issued October 14, 

1997, entitled "Apparatus for Illumination 
5 Stabilization and Homogenization" to Ortyn et al., 

which is a file wrapper continuation of abandoned U.S. 

Patent Application Serial No. 08/309,064, filed 

September 20, 1994. 

Pending U.S. Patent Application Serial No. 
10 08/697,480, filed August 26, 1996, entitled 

"Biological Specimen Analysis System Processing 

Integrity Checking Apparatus" to Ortyn et al., which 

is a file wrapper continuation of abandoned U.S. 

Patent Application Serial No. 08/309,249, filed 
15 September 20, 1994. 

U.S. Patent No. 5,715,326, issued February 3, 

1998, entitled "Cytological System Illumination 
Integrity Checking Apparatus and Method" to Ortyn et 
al. 

20 U.S. Patent No. 5 , 581 , 631 , issued December 3, 

1996, entitled "Cytological System Image Collection 
Integrity Checking Apparatus" to Ortyn et al . 

U.S. Patent No. 5,557,097, issued September 17, 

1996, entitled "Cytological System Autofocus Integrity 
25 Checking Apparatus" to Ortyn et al . 

U.S. Patent No. 5,499,097, issued 3/12/96, 
entitled "Method and Apparatus for Checking Automated 
Optical System Performance Repeatability" to Ortyn et 
al. 

30 u.S. Patent No. 5,692,066, issued November 25, 

1997, entitled "Method and Apparatus for Image Plane 
Modulation Pattern Recognition" to Lee et al . 

U.S. Patent No. 5,799,101, which will ■ issue 
8/25/98, entitled "Method and Apparatus for Highly 
35 Efficient Computer Aided Screening" to Lee et al . , 



SUBSTITUTE SHEET (RULE 26) 



WO 99/01842 



PCTAJS98/13670 



10 



15 



- 25 - 

which is a file wrapper continuation of abandoned U.S. 
Patent Application Serial No. 08/315,719, filed 
September 30, 1994. 

U.S. Patent No. 5,787,208, issued 7/28/98, 
entitled "Image Enhancement Method and Apparatus" to 
Oh et al. 

U.S. Patent No. 5,642,441, issued June 24, 1997, 
entitled "Apparatus and Method for Measuring Focal 
Plane Separation" to Riley et al . 

U.S. Patent No. 5,625,706, issued April 29, 1997, 
entitled "Method and Apparatus for Continuously 
Monitoring and Forecasting Slide and Specimen 
Preparation for a Biological Specimen Population" to 
Lee et al . 

U.S. Patent No. 5,745,601, issued April 28, 1998, 
entitled "Robustness of Classification Measurement 
Apparatus and Method" to Lee et al . 

U.S. Patent No. 5,6 71,288, issued September 23, 
1997 entitled "Method and Apparatus for Assessing 
Slide and Specimen Preparation Quality" to Wilhelm et 
al. 

U.S. Patent No. 5,621,519, issued April 15, 1997, 
entitled "Imaging System Transfer Function Control 
Method and Apparatus" to Frost et al . 

U.S. Patent No. 5,619,428, issued April 8, 1997 
entitled "Method and Apparatus for Integrating An 
Automated System to a Laboratory" to Lee et al. 

U.S. Patent No. 5,781,667, issued July 14, 1998 
entitled "Apparatus for High Speed Morphological 
3 0 Processing" to Schmidt et al. 

U.S. Patent No. 5,642,433, issued June 24, 1997, 
entitled "Method and Apparatus for Image Contrast 
Quality Evaluation" to Lee et al . 

U.S. Patent No. 5,710,842, issued January 20, 
35 1998, entitled "Method for Identifying Obj.ects Using 
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Data Processing Techniques" to Lee, which is a 
divisional of U.S. Patent No. 5,528,703, issued June 
18, 1996. 

Pending U.S. Patent Application Serial No. . 
5 08/788,239, filed January 25, 1997, entitled "Method 
and Apparatus for Alias Free Measurement of Optical 
Transfer Function" to Oh et al . 

U.S. Patent No. 5,654,535, issued August 5, 1997, 
entitled "Cytological System Autofocus Integrity 
10 Checking Apparatus" to Ortyn et al., which is a 
divisional of U.S. Patent No. 5,557,097, issued 
September 17, 1996. 

Allowed U.S. Patent Application Serial No. 
08/784,316, filed January 16, 1997, for which the 
15 issue fee has been paid, entitled "Method and 
Apparatus for Detecting a Microscope Slide Coverslip" 
to Rosenlof et al . , which is a divisional of U.S. 
Patent No. 5,638,459, issued June 10, 1997. 

Pending U.S. Patent Application Serial No. 
20 08/767,457, filed December 16, 1996, entitled "Method 
and Apparatus for Efficacy Improvement in Management 
of Cases With Equivocal Screening Results" to Lee et 
al. 

Pending U.S. Patent Application Serial No. 
25 08/821,699, filed March 20, 1997, entitled 
"Cytological System Autofocus Integrity Checking 
Apparatus" to Ortyn et al . , which is a divisional of 
U.S. Patent No. 5,654,535, issued August 5, 1997. 

U.S. Patent No. 5,760,3 87, issued June 2, 1998, 
30 entitled "Cytological System Autofocus Integrity 
Checking Apparatus" to Ortyn et al . , which is a 
divisional of U.S. Patent No. 5,654,535, issued August 
5, 1997. 

U.S. Patent Application Serial No. 08/823,793, 
35 filed March 20, 1997, for which the issue fee has been 
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paid, entitled "Cytological System Autofocus Integrity 
Checking Apparatus" to Ortyn et al., which is a 
divisional of U.S. Patent No. 5,654,535, issued August 
5, 1997. 

U.S. Patent No. 5,763,871, issued June 9, 1998, 
entitled "Cytological System Autofocus Integrity 
Checking Apparatus" to Ortyn et al., which is a 
divisional of U.S. Patent No. 5,654,535, issued August 
5, 1997. 

Allowed U.S. Patent Application Serial No. 
08/821,694, filed March 20, 1997, entitled 
"Cytological System Autofocus Integrity Checking 
Apparatus" to Ortyn et al . , which is a divisional of 
U S. Patent No. 5,654,535, issued August 5, 1997. 

Allowed U.S. Patent Application Serial No. 
08/816,837, filed March 13, 1997, for which the issue 
fee has been paid, entitled "Astigmatism Measurement 
Apparatus and Method Based on a Focal Plane Separation 
Measurement" to Riley et al . , which is a divisional of 
20 U.S. Patent No. 5,642,441, issued June 24, 1997, 
entitled "Apparatus and Method for Measuring Focal 
Plane Separation." 

Pending U.S. Patent Application Serial No. 
08/888,115, filed July 3, 1997, entitled "Method and 
Apparatus for Maskless Semiconductor and Liquid 
Crystal Display Inspection" to Lee et al . 

Pending U.S. Patent Application Serial No. 
08/888,120, filed July 3, 1997, entitled "Method and 
Apparatus for A Reduced Instruction Set Architecture 
for Multidimensional Image Processing" to Lee et al . 

Pending U.S. Patent Application Serial No. 
08/888,119, filed July 3, 1997 entitled "Method and 
Apparatus for Incremental Concurrent Learning in 
Automatic Semiconductor Wafer and Liquid Crystal 
35 Display Defect Classification" to Lee et al. 
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Pending U.S. Patent Application Serial No. 
08/888,116, filed July 3, 1997, entitled "Method and 
Apparatus for Semiconductor Wafer and LCD Inspection 
Using Multidimensional Image Decomposition and 
5 Synthesis" to Lee et al. 

All of the above patent applications and patents 
are incorporated herein, in their entirety, by the 
foregoing references thereto. 

The invention has been described herein in 

10 considerable detail in order to comply with the Patent 
Statutes and to provide those skilled in the art with, 
the information needed to apply the novel principles 
and to construct and use such specialized components 
as are required. However, it is to be understood that 

15 the invention can be carried out by specifically 
different equipment and devices, and that various 
modifications, both as to the equipment details and 
operating procedures, can be accomplished without 
departing from the scope of the invention itself. 

20 What is claimed is: 
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CLAIMS 

A method for incremental concurrent learning in 
automatic defect classification comprising the 
steps of : 

(a) providing defect images with truth labels 
(40) ; 

(b) performing a context sensitive segmentation 
to produce a segmented image output (4 2) ; 

(c) extracting a set of comprehensive features 
from a segmented image output to provide a 
feature output (44) ; 

(d) performing automatic feature selection (46) 
and transformation of the feature output 
having an application feature set output 
(60) ; 

(e) automatic classification rule generation 
(48) to provide classification rules for 
automatic feature selection and 
transformation and to provide initial 
classification rules (62) ; 

(f ) providing new potential defects (50) ; 

(g) performing a context sensitive segmentation 
of the new potential defects (52) ; 

(h) performing feature extraction of potential 
defects (54) ; and 

(i) performing feature classification (56) of 
the extracted potential defects using the 
initial classification rules (62) to provide 
defect classification (58) . 

The method of claim 1 further comprising the 
steps of: 

(a) providing potential defects (60) and 
fabrication information (63) ; 

(b) providing a primary classification rule and 
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secondary classification rule selection (84) 
from a defect knowledge database (86) from 
multiple products with different process 
cycles; 

(c) performing primary defect classification 
having a classification output (62) ; 

(d) checking a confidence of the classification 
output to decide whether to perform a 
secondary defect classification (64) ; 

(e) if the confidence is not high, performing a 
secondary defect classification (66) ; 

(f ) checking the confidence of a secondary 
defect classification (70) and if the 
confidence of the secondary defect 
classification is high, defining a 
classification outcome and providing the 
classification outcome to the primary 
classification rule and secondary 
classification rule selection step (74) ; 

(g) if the confidence of the secondary defect 
classification is not high, questioning the 
classification outcome (72) and sending 
potential defects to a novelty defect 
detection step (76) ; 

(h) performing the novelty defect detection step 

(76) to define artifacts (78) or potential 
new defect type to provide information for 
a truth inquiry (80); and 

(i) performing a truth inquiry (80) to update a 
classification rule database for use by the 
primary classification rule and secondary 
classification rule selection (82) . 

The method of claim 1 further comprising the 
steps of : 
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(a) providing potential defects (60) and 
fabrication information (63) ; 

(b) selecting a multiple stage classification 
rule (61) from a defect knowledge database' 
from multiple products with different 
process cycles (86) ; 

(c) performing multiple stage classification 
having at least one classification result 
and at least one confidence (61) ; 

(d) checking the at least one confidence (64, 
70) and if the confidence is high, defining 
a classification outcome (68, 70) and 
providing the classification outcome (68, 
70) to the multiple stage classification 
rule selection step (84) ; 

(e) if the at least one confidence is not high, 
questioning the classification outcome and 
sending the potential defects to the novelty 
defect detection module (76) ; 

(f) performing a novelty defect detection step 
to define artifacts (78) or potential new 
defect type to provide information for a 
truth inquiry (80) ; and 

(g) performing the truth inquiry (80) to update 
a classification rule database (86) for use 
by the multiple stage classification rule 
selection (61) . 

The method of claim 2 wherein the novelty defect 
detection step (76) comprises: 

(a) providing potential novel defects (104) ; 

(b) performing artifact rejection (90) of 
potential novel defects and generating an 
artifact database (94) and potential novel 
defect database (92) ; and 
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(c) performing a clustering of a potential novel 
defect database (98) to provide defect 
clusters (100) and other objects (102), and 
providing a truth determination (96) of the 
defect clusters (100) to update the 
potential novel defect database (92) . 

The method of claim 2 wherein the step of 
performing automatic classification rule 
generation comprises: 

(a) providing defect samples with truth labels 
where features are selected to form a 
feature space (106) ; 

(b) performing sample editing on the defect 
knowledge database to generate an edited 
result set (108) ; 

(c) performing a sample condensing on the edited 
result set to remove redundant samples 
(110) ; 

(d) performing adaptive learning on the feature 
space to identify a new prototype (112, 
114) ; and 

( e) generating high and low confidence 
thresholds for the new prototype with new 
classification rules (116) . 

The method of claim 2 wherein the step of 
performing a truth inquiry further comprises the 
steps of : 

(a) perform editing and condensing on a defect 
cluster to select a subset of defects (120) ; 
and 

(b) performing a similarity detection step (124) 
to compare known defects to representative 
unknown defects . 
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The method of claim 2 wherein the defect 
knowledge database comprises a plurality of 
products wherein each product has a processing 
layer where each processing layer has a plurality 
of defect types wherein each defect type has a 
plurality of rules wherein each rule has a 
prototype (Figure 7) . 

The method of claim 1 wherein the defects are 
semiconductor wafer defects. 

The method of claim X wherein the defects are 
liquid crystal display defects. 

A method for incremental concurrent learning 
comprising the steps of : 

(a) providing potential defects (60) and 
fabrication information (63) ; 

(b) providing a primary classification rule and 
secondary classification rule selection (84) 
from a defect knowledge database (86) from 
multiple products with different process 
cycles ; 

(C ) performing a truth inquiry (80) to update a 
classification rule database for use by the 
primary classification rule and secondary 
classification rule selection step (82); 

(d) performing primary defect classification 
having a classification output (62) ; 

(e) checking a confidence of the classification 
to decide whether to perform a secondary 
defect classification (64) ; 

(f) checking the confidence of a secondary 
defect classification (70) and if the 
confidence of the second defect 
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classification is high, defining the 
classification outcome and providing the 
classification outcome to the primary 
classification rule and secondary 
5 classification rule selection step (74) ; 

(g) if the confidence of the secondary defect 
classification is not high, questioning the 
classification outcome (72) and sending the 
potential defects to a novelty defect 

10 detection step (76) ; and 

(h ) performing the novelty defect detection step 

(76) to define artifacts (78) or a potential 
new defect type to provide information for 
the truth inquiry (80) . 

11. The method of claim 10 wherein the novelty 
detection step (76) comprises: 

(a) providing potential novel defects (104); 

(b) performing artifact rejection of potential 
novel defects and generating an artifact 
database (94) and potential novel defect 
database (92) ; and 

( C ) performing a clustering of a potential novel 
defect database (98) to provide defect 
clusters (100) and other objects (102) 
providing a truth determination (96) of the 
defect clusters (100) to update the 
potential novel defect database (92) - 

The method of claim 10 wherein the step of 
automatic classification rule generation further 
comprises the steps of: 

(a) providing defect samples with truth labels 
where features are selected (10 6) ; 
35 (b ) performing a sample editing on - the defect 



20 



25 



30 12 



WO 99/01842 



PCT/US98/13670 



10 



20 



25 



-35- 

knowledge database (108) ; 

(c) performing a condensing of an output of the 
sample editing (110) ; 

(d) performing adaptive learning step for the 
initialization of prototypes (112) ; 

(e) performing adaptive learning (114); and 

(f) generating high and low confidence 
thresholds for the classification rules to 
be generated (116) . 



13. The method of claim 10 wherein the step of 
performing a truth inquiry further comprises the 
steps of: 

(a) editing and condensing a defect cluster 

15 (120) ; and 

(b) performing a similarity detection step (124) 
to compare known defects to representative 
unknown defects . 



14 . The method of claim 10 wherein the defect 
knowledge database comprises a plurality of 
products wherein each product has a processing 
layer where each processing layer has a plurality 
of defect types wherein each defect type has a 
plurality of rules wherein each rule has a 
prototype (Figure 7) . 

15. The method of claim 10 wherein the defects are 
semiconductor wafer defects. 

16. The method of claim 10 wherein the defects are 
liquid crystal display defect's. 

17. a method of incremental concurrent learning 
35 comprising the steps of: 
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ta) providing potential defects (60) and 

fabrication information (63) ; 
(b) providing a multiple classification rule 
selection (61) from a defect knowledge 
database from multiple products with 
different process cycles (86) ; 
performing a truth inquiry (80) to update a 
classification rule database for use by the 
multiple classification rule selection (82) ; 
performing multiple stage classification 
having at least one classification output 
and at least one confidence output (61) ; 
checking the at least one confidence (64, 
70) and if the confidence of the at least 
one classification is high, defining the 
classification outcome (68, 70) and 
providing the classification outcome (68, 
70) to the multiple classification rule 
selection step (84) ; 

if the at least one confidence is not high, 
questioning the classification outcome and 
sending the potential defects to the novelty 
defect detection module (76) ,-. and 
(g) performing a novelty defect detection step 
to define artifacts (78) or potential new 
defect type to provide information for the 
truth inquiry (80) . 

18. The method of claim 17 wherein the novelty 
detection step (76) comprises: 

(a) providing potential novel defects (104); 

(b) performing artifact rejection (90) of 
potential novel defects and generating an 
artifact database (94) and potential novel 
defect database (92) ; and 
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(c) performing a clustering of a potential novel 
defect database (98) to provide use of 
clusters (100) and other objects (102) 
providing a truth determination (96) of the 
defect clusters (100) to update the 
potential novel defect database (92). 

The method of claim 17 wherein the automatic 
classification rule generation comprises: 
10 (a) providing defect samples with truth labels 

where features are selected (106); 

(b ) performing a sample editing on the defect 
knowledge database (108) ; 

(c) performing a condensing of an output of the 
sample editing (110) ; 

(d) performing adaptive learning step for the 
initialization of prototypes (112) ; 

(e) performing adaptive learning with a LVQ 
method (114) ; and 

(f) generating high and low confidence 
thresholds for the rules to be generated 
(116) . 
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The method of claim 17 wherein the step of 
performing a truth inquiry further comprxses the 



steps of 



(a) 



editing and condensing a defect cluster 



(120) ; and 
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(b ) performing a similarity detection step (124) 
to compare known defects to representative 
unknown defects. 

The method of claim 17 wherein the defect 
knowledge database comprises a plurality of 
products wherein each product has a. processing 
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layer where each processing layer has a plurality 
of defect types wherein each defect type has a 
plurality of rules wherein each rule has a 
prototype (Figure 7) . 

The method of claim 17 wherein the defects are 
semiconductor wafer defects. 

The method of claim 17 wherein the defects are 
liquid crystal display defects. 

24 . A method of incremental concurrent learning 
comprising the steps of: 

(a) providing potential defects (60) and 
fabrication information (63); 

(b) providing a classification rule selection 

(84) from a defect knowledge database (86) 
from multiple products with different 
process cycles; 
20 (O performing a truth inquiry (80) to update a 

classification rule database (86) for use by 
the classification rule selection (61) ; 

(d) performing a single stage classification 
having at least one classification output 
and at least one confidence output (62); 

(e) checking the at least one confidence (64) 



25 



one 



30 



35 



and if the confidence of the at least 
classification is high (68), defining the 
classification outcome and providing the 
classification outcome to the classification 
rule selection step (84) ; 
(f ) if the at least one confidence is not high, 
questioning the classification outcome (72) 
and sending the potential defects to the 
novelty defect detection module (76) ; and 
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(g) performing a novelty defect detection step 
(76) to define artifacts (78) or potential 
new defect type to provide information for 
the truth inquiry (80) . 

25. The method of claim 24 wherein the novelty 
detection step comprises: 

(a) providing potential novel defects (104) ; 

(b) performing artifact rejection (90) of 
potential novel defects and generating an 
artifact database (94) and potential novel 
defect database (92) ; and 

(c) performing a clustering of a potential novel 
defect database (98) to provide use of 
clusters (100) and other objects (102) 
providing a truth determination (96) of the 
defect clusters (100) to update the 
potential novel defect database (92) . 

The method of claim 24 wherein the automatic 
classification rule generation comprises: 

(a) providing defect samples with truth labels 
where features are selected (106); 

(b) performing a sample editing on the defect 
knowledge database (108) ; 

(c) performing a condensing of an output of the 
sample editing (110) ; 

(d) performing adaptive learning step for the 
initialization of prototypes (112) ; and 

30 (e) performing adaptive learning with a LVQ 

method (114) ; 
(f ) generating high and low confidence 
thresholds for the rules to be generated 
(116) . 
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27 The method of claim 24 wherein the step of 
performing a truth inquiry further comprises the 
steps of : 

(a) editing and condensing a defect cluster 
(120) ; and 

(b ) performing a similarity detection step (124) 
to compare known defects to representative 
unknown defects. 
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The method of claim 24 wherein the defect 
knowledge database comprises a plurality of 
products wherein each product has a processing 
layer where each processing layer has a plurality 
of defect types wherein each defect type has a 
plurality of rules wherein each rule has a 
prototype (Figure 7) . 

The method of claim 24 wherein the defects are 
semiconductor wafer defects. 

The method of claim 24 wherein the defects are 
liquid crystal display defects. 
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